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Abstract 

Understanding how organisms interact with their environments is increasingly 
important for conservation efforts in many species, especially in light of highly 
anticipated climate changes. One method for understanding this relationship is 
to use genetic maps and QTL mapping to detect genomic regions linked to phe- 
notypic traits of importance for adaptation. We used high-throughput genotyp- 
ing by sequencing (GBS) to both detect and map thousands of SNPs in haploid 
Chinook salmon {Oncorhynchus tshawytscha). We next applied this map to detect 
QTL related to temperature tolerance and body size in families of diploid Chi- 
nook salmon. Using these techniques, we mapped 3534 SNPs in 34 linkage 
groups which is consistent with the haploid chromosome number for Chinook 
salmon. We successfully detected three QTL for temperature tolerance and one 
QTL for body size at the experiment-wide level, as well as additional QTL signifi- 
cant at the chromosome-wide level. The use of haploids coupled with GBS pro- 
vides a robust pathway to rapidly develop genomic resources in nonmodel 
organisms; these QTL represent preliminary progress toward linking traits of 
conservation interest to regions in the Chinook salmon genome. 



Introduction 

Impacts of global climate change on aquatic species are dif- 
ficult to anticipate (Crozier et al. 2008). The warming of 
aquatic ecosystems may impact physiology, migration, and 
food webs; and organisms and populations are likely to 
react with both plastic and adaptive responses (Lundsg- 
aard-Hansen et al. 2013; Piou and Prevost 2013). Thus far 
however, the role of adaptation in ecological responses to 
anthropogenic change has largely been ignored (Smith and 
Bernatchez 2008). Studies of genetic and physiological 
mechanisms of adaptation are needed to predict climate 
effects on ecosystems and inform conservation manage- 
ment (Portner and Farrell 2008; Whitehead 2012). Individ- 
ual populations within species of Pacific sahnon exhibit 
diverse responses to climate regimes (Crozier and Zabel 
2006; Eliason et al. 2011; Rogers et al 2013), indicating 
allelic variation for genes that mediate adaptively important 
physiological pathways. 



Climate change and predicted oceanic temperature 
increases may result in substantial reductions in habitat for 
Pacific salmon (Schindler et al. 2008). Projected habitat 
losses are largest in the Gulf of Alaska and western and sub- 
arctic North Pacific Ocean. Nearly complete loss of Gulf of 
Alaska habitat for some species in some seasons, especially 
Chinook salmon (Oncorhynchus tshawytscha) in summer, 
raises important policy issues for fishery managers and 
governments ( Abdul- Aziz et al. 2011). 

The impacts of climate changes in freshwater habitats are 
expected to be even more pronounced than those in marine 
habitats: freshwater fish may have fewer options to avoid 
temperatures that are above optimal (or lethal) than wiU 
marine fish (Breau et al. 2011; Sutton and Soto 2012). 
Reproductive success and even survival of spawning adults 
(Farrell 2009; Evans et al 2011; Jeffries et al. 2012) and 
growth and behavior of juveniles may vary greatly in sub- 
stantially altered thermal regimes (Sykes et al. 2009; Sutton 
and Soto 2012). Thermal increases are indicated in the 
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emergence of new diseases in freshwater that threaten natu- 
ral populations (Kocan et al. 2004, 2009; Kocan and Hersh- 
berger 2006). Genetic data on thermal tolerance in 
salmonid species are slowly accumulating (Matala et al. 
2011; Quinn et al. 2011; Anttila et al. 2013), but a more 
thorough and comparative approach is needed to provide 
insight into individual and population differences and to 
potentially inform conservation management (Whitehead 
2012). 

Chinook salmon is a cornerstone species with key eco- 
nomic and ecological impacts throughout the North Pacific 
Ocean and its freshwater drainages. Chinook salmon are 
anadromous and semelparous; as a result, they are a major 
conduit for transferring nutrients from the marine environ- 
ment to support freshwater and terrestrial ecosystems. Chi- 
nook salmon are caught in commercial, sport, and 
subsistence fisheries in North America and Asia, and they 
are an important food source for marine mammals includ- 
ing the endangered killer whale Orcinus orca in the North- 
east Pacific Ocean (Williams et al. 2011). Uncertain fate in 
the face of climate change is considerably greater for organ- 
isms such as Chinook salmon that have complex migratory 
life cycles where selection pressures might differ greatly 
between marine and freshwater life stages (Crozier et al. 
2008; Schindler et al. 2008). 

Genotyping by sequencing is a relatively new approach 
(Baird et al. 2008; Hohenlohe et al. 2010; Davey et al. 
2011) that has rapidly accelerated gene discovery and the 
acquisition of large quantities of genomic data, facilitating 
genome mapping and QTL studies in nonmodel organisms 
(Barchi et al. 2012; Houston et al. 2012; Keller et al. 2013). 
Miller et al. (2012) used GBS to create a dense meiotic map 
in rainbow trout O. mykiss and detected a large haplotype 
block responsible for rate of embryo development; Narum 
et al. (2013) interrogated 10 000 random SNPs in rainbow 
trout and detected several that were associated with sur- 
vival under thermal stress. The use of dense meiotic maps 
has proven to be a powerful framework for study of geno- 
mic architecture and spatial structure of divergence in nat- 
ural populations (Bradbury et al. 2013; Gagnaire et al. 
2013b). 

Our primary objective was to use GBS to create a dense 
meiotic map for Chinook salmon to provide a template for 
later comparative mapping, QTL mapping, and population 
genomics studies. We used a single gynogenetic haploid 
family to map novel SNPs and organized them into linkage 
groups, consistent with the haploid chromosome number 
for Chinook salmon. Annotations were unremarkable; 
however, testing for QTL in diploid families revealed ther- 
mal tolerance signals isolated to LG 11, LG 16, and LG 34. 
Additionally, several growth-associated QTL were isolated 
to other linkage groups. We believe that this combination 
of GBS, efficient mapping using haploids, and QTL 
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analyses in diploids will provide a robust pathway to rap- 
idly develop genomic resources for Chinook salmon as well 
as other nonmodel organisms and provide a tool for identi- 
fying vulnerable populations to inform conservation man- 
agement. 

Methods 

Animals 

The project design was to conduct meiotic mapping on one 
haploid family and QTL mapping in one pair of diploid 
half-sib families. However, we designed the matings with 
adequate redundancy, following Institutional Animal Care 
and Use Committee protocol 4229-01, to guarantee that 
sufficient haploid individuals were available given the 
uncertainty of the treatment. 

Matings were created from individuals in the University 
of Washington hatchery population over the course of 
2 weeks. Six females and three males were used to make 
pairs of half-sib families (see Table 1 for half-sib mating 
strategy). A portion of the gametes from each of the 
females was also combined with UV-irradiated sperm from 
one of the Chinook salmon males or a coho salmon male 
to create gynogenetic haploid families for genetic mapping 
(cf, Thorgaard et al. 1983; Spruell et al. 1999). Fin clips 
from all parents were collected in ethanol for DNA analy- 
sis. 

Haploid embryos from the six families were incubated 
for 2 weeks and then sampled into ethanol. Sampling was 
performed just prior to scheduled hatch because haploid 
salmonids do not survive the hatching process. Scheduled 
hatching time, based upon incubation temperature, was 
predicted using the software IncubWin (http://www.pac. 
dfo-mpo.gc.ca/science/aquaculture/sirp/incubwin-eng.html). 

The diploid families were reared for a total of 10 weeks 
after hatching. At that time, they were subjected to the 
temperature challenge for QTL analysis. 



Table 1. Mating design for the creation of haploid and diploid families. 



Male parent 


Female parent 


Male parent 


(diploids) 


(haploids and diploids) 


(haploids) 


M5 


F9 


Coho M3 




F10 




M6 


F11 


Coho M7t 




F12 




M7* 


F13* 






F14*t 





QTL, quantitative trait loci. 

*Parents of the diploid, half-sib families used for QTL detection, F13, 
and F14. 

fParents of the haploid family used for creation of the linkage map, 
F14H. 
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Temperature challenge 

Individuals were starved for 48 h prior to the temperature 
challenge. Twenty-four hours prior to the start of the 
experiment, the tank water temperature was allowed to 
gradually rise from 12 to 18°C. At the start of the trial, the 
temperature was raised from 18°C to 25 ± 1°C over the 
course of 1 h by adding heated water to the system. This 
temperature was then maintained for the remainder of the 
trial, and the individuals were monitored for loss of equi- 
librium. As soon as an individual lost equilibrium, it was 
euthanized in MS-222 and recorded as a temperature-sen- 
sitive individual. After 50 temperature-sensitive individuals 
were sampled from each family, an additional 50 'tempera- 
ture tolerant' individuals were immediately sampled and 
the thermal challenge ended. The total time from initiation 
of the temperature challenge until the 50 temperature toler- 
ant individuals were sampled from all three pairs of half- 
sib families was approximately 8 h. Sampled individuals 
were measured for fork-length and weight, and the whole 
caudal fin from each individual was preserved in ethanol 
for DNA analysis. 

DNA extraction, genotyping, and sequencing 

All DNA extractions were carried out using DNEasy-96 kits 
(Qiagen, Valencia, CA, USA) following manufacturer's 
directions. A subsample of each axillary process from adults 
or whole caudal fin from diploid offspring was added to the 
lysis buffer directly. Whole haploid embryos were dissected 
from the yolk and chorion and added to the lysis buffer. 

To inform the selection of a family for mapping and pro- 
vide validation of the UV treatment in haploid embryos, an 
initial 95 haploid embryos from each family (Table 1) were 
screened for variation at a panel of 96 EST loci using the 5' 
nuclease reaction, following the methods of Seeb et al. 
(2009). We wanted to select families to maximize the num- 
ber of segregating ESTs to provide anchors on the map; 
also the absence of male alleles at EST loci was a confirma- 
tion of haploidy. All of the 5' nuclease assays used in this 
study originated from Smith et al. (2005a,b), Campbell 
and Narum (2008), Clemento et al. (2011), and Larson 
et al (2013). 

Haploid family F14H was selected for mapping because 
it had the largest number of segregating ESTs and few non- 
haploid offspring. (This choice also determined the pair of 
half-sib families, F14 and F13, for QTL analysis.) Then, an 
additional 142 putative haploids from haploid family F14H 
were geno typed; 213 true haploids identified in these two 
steps were retained for further analyses. Finally, an 
additional 50 EST loci were genotyped in the true haploid 
offspring. All segregating EST loci were included in the 
linkage map. 
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Restriction-site-associated DNA (RAD) sequencing, fol- 
lowing the methods of Baird et al. (2008), was carried out 
on all haploid embryos, diploid progeny, and nine adults 
that included the three parents of the half-sib families 
(F13 X M7; F14 x M7) and the six additional parents 
from the families reared for the thermal challenge 
(Table 1). All nine adults underwent paired-end sequenc- 
ing to provide longer templates to improve annotation 
(Table SI). All other samples were processed using single- 
end sequencing. 

Genomic DNA from each individual was cut with the 
restriction enzyme Sbfl. Next, barcoded lUumina adapters 
were ligated to the cut site. Each individual received a 
unique barcode within each library, allowing individuals to 
be separated for data analysis. Groups of up to 24 diploids 
or 48 haploids (Table 1) were pooled, and the pooled DNA 
was sheared. For single-end libraries, DNA was sheared to 
around 500-bp fragments, while for paired-end libraries, 
DNA was sheared to around 200-bp fragments. The 
sheared DNA was cleaned up with a Qiagen MinElute kit, 
and a second lUumina adapter was added. The two lUu- 
mina adapters are necessary for fragment amplification, 
allowing selective amplification of only those fragments 
which contain a Sbfl restriction site. Finally, the libraries 
were amplified with PGR, cleaned up, and sequenced. All 
libraries were sequenced on an lUumina HiSeq 2000 at the 
University of Oregon, USA. 

Sequence analysis 

All RAD sequence analysis and genotyping were carried out 
using the freely available software program Stacks (Gatchen 
et al. 2011). 

First, the library that included all parents was quality-fil- 
tered and all individuals separated based on their individ- 
ual barcodes using the process_radtags command. 
Process_radtags was set to discard reads with uncalled bases 
or low-quality scores. Next, SNP discovery was carried out 
only using the RAD sequences from the female from family 
F14; the sequences from the additional adults were retained 
for later paired-end assembly and analysis (see below). The 
ustacks command was run on the female parent using 
default settings, allowing the program to resolve over 
merged tags and discard highly repetitive sequences. Next, 
the cstacks command was run with default setting on the 
ustacks output from the female parent. The resulting cata- 
log was used for all other alignments (see below). Finally, 
sstacks was run on the female using default settings and the 
catalog created in the previous step. 

Restriction-site-associated DNA genotyping of the hap- 
loid offspring was also carried out using Stacks. Haploid 
libraries from F14 were run through the pipeline {pro- 
cessjradtags, ustacks, sstacks) using the same procedures as 
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carried out on the female parent. The resulting output, 
along with the output and catalog from the female parent, 
were used with the Stacks genotypes command. Genotypes 
were set to exclude any marker that was genotyped in < 160 
individuals (80% of total number of haploids) and provide 
generic output. The resulting genotype output was opened 
and converted to R/qtl (Broman et al. 2003) format for 
linkage mapping in Microsoft Excel. 

Restriction-site-associated DNA genotyping of the dip- 
loid offspring along with the two additional parents from 
the half-sib family was carried out using Stacks and the 
same procedures as the haploid offspring and parents (pro- 
cess_radtags, ustacks, sstacks, and genotypes). The resulting 
genotypes were converted into the correct formatting for 
GRiDQTL in Microsoft Excel. 

Linkage mapping 

Map construction was carried out using R/qtl (Broman 
et al. 2003), using the parameters for doubled haploids. 
The genotypes from the EST assays were added to the data- 
set from RAD sequencing. Markers with significant segre- 
gation distortion (chi-squared test, P < 10^'°) were 
removed from all further analysis. Recombination fractions 
and LOD scores were calculated using the esf.rf command. 
After this step, markers with no recombination were identi- 
fied, and all but a single marker were removed from initial 
grouping. Initial linkage group formation was carried out 
using the formLinkageGroups command with an LOD 
threshold of 6 and a maximum recombination fraction of 
0.35. Groups were examined using plot.rf. Initial marker 
order was calculated with orderMarkers, using the Kosambi 
map function. Order was confirmed using ripple with the 
maximum-likelihood algorithm and a window size of 6. 
Any discrepancies in marker order discovered through 
either examination of the recombination plot or ripple 
were corrected using switch.markers. Markers with no 
recombination were added back to the map, and the final 
map was exported as a text file and figures were examined 
using the software program mapchart. 

Paired-end assembly and annotation 

Assembly of the paired-end sequences was carried out to 
improve annotation of markers included in the map. 
Paired-end sequences from the nine parents were assem- 
bled using the paired-end module provided as part of the 
Stacks program and the freely available software program. 
Velvet (Zerbino and Birney 2008) (v. 1.1.06). First, ustacks 
was run on each of the remaining eight parents [the addi- 
tional five females and three males from the half-sib fami- 
lies initially selected for rearing (see above)]. Next, the list 
of SNPs corresponding to the genetic map was used with 
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the paired-end module from Stacks on the ustacks output 
and paired-end sequence files, providing sets of corre- 
sponding paired-end sequences for each mapped RAD tag. 
Velvet was run on this output using default parameters and 
a minimum contig length of 150 bp. Resulting contigs were 
compared to the SwissProt database (release 2012_10) 
using BLASTX and annotated if they matched with an e-value 
of 10^* or less. The resulting annotations were added to 
the linkage map. 

Comparison with other species 

Sequences from each RAD tag included in the map were 
compared to RAD tags from rainbow trout O. mykiss 
(Miller et al. 2012) and sockeye salmon O. nerka (Everett 
et al. 2012) using stand-alone blast (v. 2.2.25). Hits that 
matched along the entire length of either the trout or 
sockeye tag (trout and sockeye tags were shorter than the 
Chinook tags because of the sequencing chemistry used 
in those studies) and contained no more than two mis- 
matches were retained. Positions of the tags in each spe- 
cies map were examined for synteny among the three 
species. 

Analysis of variance on phenotypes 

The diploid offspring from each female (F13 x M7; 
F14 X M7) were reared in separate tanks, introducing a 
confounding tank effect in growth related traits. A one-way 
analysis of variance (anova) was carried out on the length 
and weight traits to account for this effect, and the residu- 
als from this analysis were used in the QTL analysis for 
length and body weight in the paternal half-sibs. 

QTL analysis for temperature tolerance, length, and 
weight 

QTL analysis was carried out using the online software pro- 
gram GRIDQTL (Seaton et al. 2006), which utilizes the half- 
sib linear regression model developed by Knott et al. 
(1996). This model used is yij = a,+hiXij+e where is the 
phenotype of individual; from sire or dam i, a,- mean effect 
for half-sib family i, fo, is regression coefficient within half- 
sib family i, is the probability of inheriting a parental 
allele, and eij is the residual error. The data sets were ana- 
lyzed two ways using the half-sib modules in the software. 
First, the fuU paternal half-sib families were analyzed using 
the standard parameters for all three traits included, tem- 
perature tolerance, length, and weight. Next, the offspring 
from each female were analyzed separately, still using the 
half-sib module, classifying the dam as the shared parent to 
examine female-linked traits. A single QTL per linkage 
group was specified for both analyses. 
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The analysis was carried out both across all of the 34 
linkage groups (experiment wide) and within each linkage 
group (chromosome wide), f -statistics were calculated at a 
1-cM interval on each linkage group, and the f -threshold 
for significance was determined via a permutation test with 
10 000 replicates, using the method developed by Churchill 
and Doerge (1994) as executed in the gridqtl software 
(Seaton et al. 2006). _F-thresholds for 'significant' QTL were 
set at P < 0.05 at the experiment-wide level and P < 0.01 
at the chromosome-wide level. F-thresholds for P < 0.05 at 
the chromosome-wide level are considered 'suggestive'. 
The percentage of phenotypic variance explained (PVE) by 
each QTL detected was calculated as PVE = 4[l-(MSEfuu/ 
MSEreduced)] where MSEfuu and MSEreduced are the mean 
squared error of the full model and mean squared error of 
the reduced model (parameters fixed), respectively (Knott 
et al. 1996). The 95% confidence intervals for the position 
of each QTL were determined using 10 000 bootstraps with 
resampling in the gridqtl software. 

Results 

Sequencing 

Prior to sequencing, all putative haploid offspring from F14 
and parents (F14, F13, M7) were successfully genotyped with 
the EST assays (Table SI). Genotyping showed that 25 of the 
putative haploids were diploids, and these were excluded 
from further analysis. RAD sequencing was successfully car- 
ried out on the remaining 213 haploid embryos as well as all 
nine adults (F14, F13, M7, and six additional adults, see 
Methods) and the 200 diploid half-sibs offspring of F14, 
F13, and M7. After quality filtering, between 729 487 and 
6 685 959 reads (average 2 830 029 ± 1 427 321) were 
obtained for each individual (Table S2). 

SNP discovery and haploid genotyping 

Single nucleotide polymorphism discovery in the female parent 
of the haploids, F14, produced 1 1 427 putative SNP loci. These 
loci were filtered using genotypes from the haploid 
individuals to remove PSVs and genotyping errors (see below). 

Genotyping was successfully carried out on all haploid 
offspring of F14 using the catalog of all 11 427 putative 
SNPs discovered in the female. These genotypes went 
through a two-stage screening process. First, markers that 
were fixed for the same allele in all haploid individuals 
(erroneous heterozygous calls in the female parent) were 
removed, leaving more than 7000 putative SNPs. Next any 
RAD marker missing genotypes in more than 53 of the 213 
individuals (25%) was removed. Stacks defaults to doubled 
haploid output when only a single parent is included in the 
catalog; in this format, heterozygous genotype calls are 
converted to missing calls, so filtering RAD markers miss- 



ing more than 25% of individuals removes both true errors 
and PSVs. After these filters, 4905 segregating SNPs 
remained. These were combined with the genotype data 
from 91 EST assays for further analysis. After combining, 
the complete data set was screened to remove 1461 markers 
with significant segregation distortion (chi-square 
P < 10"'°). After these filters, 3535 RAD and EST SNPs 
remained for linkage mapping. 

Linkage mapping 

Linkage mapping was successfully carried out using the 
doubled haploid module in R/qtl. A minimum LOD score 
of 6 and a maximum recombination fraction of 0.35 were 
used for the construction of all linkage groups. Initial link- 
age analysis identified a single marker with an erroneous 
linkage relationship. This marker was removed from fur- 
ther analysis, and the linkage relationships were recalcu- 
lated. The final map contained 3534 SNPs, arranged in 34 
linkage groups and a single unlinked marker. Each linkage 
group contained between 24 and 193 SNPs. Linkage groups 
ranged in size from 27.75 to 160.23 cM, with an average 
marker spacing of between 0.39 and 1.78 cM (Fig. 1, 
Table 2, Table S3). The total female map length was 
2483.41 cM. Each linkage group also contained blocks of 
between two and 15 markers with no detectable recombi- 
nation (Table S3, Fig. 1). 

Using Stacks and Velvet, 3462 contigs were successfully 
assembled from the paired-end sequences. These contigs 
were compared to the SwissProt database using blastx, and 
hits with an e-value <10"* were retained. In this manner, 
426 SNPs were successfully annotated (Table S4). 

The 426 annotated RAD tags include a variety of biologi- 
cal processes and when possible were assigned GQSlim 
terms. Nearly half (209/426) of all annotations were trans- 
posable elements. The remaining annotations included 
genes involved in transcription (18), transport of proteins 
and ions (19), cell differentiation (11), development (6), 
and others (16) (Table S3). The remaining nontransposable 
element sequences were not assigned a process using 
GOSlim terms. These proteins may have unknown func- 
tions or their functions are not included in those included 
in the GOSlim definitions. 

Restriction-site-associated DNA tags included in the 
map were successfully compared to previously mapped 
RAD tags from sockeye salmon (Everett et al. 2012) and 
rainbow trout (Miller et al. 2012). Forty-one tags were 
shared between sockeye salmon and Chinook salmon, and 
81 tags were shared between rainbow trout and Chinook 
salmon (Table 3). These matches were spread throughout 
the genomes of all three species. 

The final linkage map was used for QTL analysis with the 
diploid half-sibs. Diploid individuals were successfully 
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LG-8 



Table 2. Linkage summary. The number of markers, length, and aver- 
age marker spacing of each of the 34 Chinook salmon linkage groups. 



24.0—, 
24.5 



^34254 12107 19432_TC1A 89350 7763 
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- 24000 

^ 110031 
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H 24430 13252 
H20159 13548 



^ 12754 
^ 19275 
— 3950 



H80574 14058 
H43119 ANK2 42903 



-3959 

H104630_SUH 16179_TCB2 

J 24399 15943 6589 8241_TC3A 11366 

126359 

H60170 74816 

-62112 

H 73782 65102 4586 100580 37559 
H20211 33428_TCB2 
H60927 50144_TCB1 49720 90626 90246 
458943 33265 

^7835 87701_RENT1 25544 95175 
^51093_TCB1 

(J 791 90 48242 6771 27043 102344 
12023 6769 

12772 
S28530 99754 

H69786 67271 33855_TC1A 8084 
J3960_PDE7A 96377 

J31063 43880 49752 55560_KI10A 88600 
'123387 75896 

HOts_110495_308 61959 30047 95496 
^20344_TC1A 
^ 52356 
-160903 58276 

25170 35386 97559 8840 2278_LAMA3 
*J 1 1 635 46564 46759 80563 98833 
|40624 14899 4075 

- 85422 

H43439 60293 74618_TC1A 
H31624_TCB1 42942 55203 

-31219 

^13900 66144 

-52175 

^ 103712 

- 10455 

^46532_SCC4 5423 

- 27303 

151186 19461 95778 Ots_99550_204 11416 
^J3383 35963 46749_TC1A 50431 53587 
154814 68657 78559 86863 92099 



Figure 1 Linkage group 8 from the RAD based map for Chinook sal- 
mon. Markers beginning with 'Ots_' are from B'nuclease assays. Mark- 
ers which have a gene ID following the RAD number were annotated 
using the SwissProt database. RAD, Restriction-site-associated DNA. 



genotyped for the 3472 SNPs included in tlie linkage map 
using Stacks. QTL detection was carried out at both the 
experiment-wide and chromosome-wide level. At the 
experiment-wide level, a P-value of 0.05 was significant, 
and at the chromosome-wide level, a P- value of 0.01 was 
significant for QTL and 0.05 was suggestive for QTL, simi- 
lar to the procedure applied in Gutierrez et aL (2012). 





Number of 




Average marker 


Linkage group 


markers 


Length (cM) 


spacing (cM) 


1 


182 


100.06 


0.55 


2 


167 


160.23 


0.96 


3 


193 


96.39 


0.50 


4 


179 


86.1 1 


0.48 


5 


165 


97.51 


0.59 


6 


151 


86.07 


0.57 


7 


1 53 


105.34 
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g 


144 


95.35 


0.66 


g 


135 


98.75 


0.73 


10 


142 


87.41 


0.62 


1 1 
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0.78 


12 


1 26 


49.53 
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13 


124 


87.09 
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14 
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15 
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16 


88 
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17 


91 
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18 


86 
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0.64 


20 


90 


51 .95 


0.58 


21 


84 


52.63 


0.63 


22 


107 


62.72 


0.59 


23 


85 


100.35 


1.18 


24 


85 


54.66 


0.64 


25 


60 


47.03 


0.78 


26 


83 


46.08 


0.56 


27 


58 


47.57 


0.82 


28 


61 


53.16 


0.87 


29 


72 


52.71 


0.73 


30 


47 


56.27 


1.20 


31 


45 


52.74 


1.17 


32 


46 


46.52 


1.01 


33 


24 


42.66 


1.78 


34 


26 


27.78 


1.07 



Thermotolerance QTL 

Within the paternal half-sib families, two male-linked QTL 
were found at the experiment- wide level (Fig. 2A): one 
QTL was significant at the chromosome-wide level, and an 
additional two were suggestive at the chromosome-wide 
level (Table 4). The male-linked QTL significant at the 
experiment-wide level were located on linkage groups 11 at 
74 cM and 16 at 6 cM (Fig. 2A). The chromosome-wide 
QTL was located on linkage group 34 at 13 cM. The two 
suggestive QTL were located on linkage group 26 at 30 cM 
and linkage group 30 at 13 cM (Table 4). These QTL 
account for between 12 percent (suggestive QTL) and 
almost 30 percent (experiment-wide QTL) of the PVE 
(Table 4). Examining the female-linked QTL, there is a sin- 
gle QTL significant at the genome-wide level in female F14, 
located on linkage group 34 at 5 cM. There are an addi- 
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Table 3. Comparison to other species. Linkage group matclies to RAD- 
tag-based maps in rainbow trout O. mykiss and sockeye salmon O. ner- 
(Everett et al. 2012; IVIilleret al. 2012; Paiti et al. 2012). 

Chinook 



Linkage 


Rainbow Trout 


Number 


Sockeye Salmon 


Number 


Group 


Linkage Group 


of hits 


Linkage Group 


of hits 


1 


WS16 


1 


One_2B 


1 


2 










3 


WS01,WS27 


4 


One_28 


1 


4 


WS16, WS19 


4 


One_4, One_1 1 


2 


5 


WS03, WSOB 


3 


One_21, One_24 


3 


6 


WS2B 


3 






7 


WS06 


2 


One_3 


1 


8 


WS04 


3 


One_9 


2 


9 


WS07 


1 


One_13, One_27M 


2 


10 


WS24 


10 






11 


WS1B, WS22 


5 


One_18 


1 


12 


WS18 


2 


One_9, One_13 


2 


13 


WS13 


5 


One_27, One_28 M 


4 


14 


WS23 


4 






IB 


WS17, WS2B 


2 


One_14 


1 


16 


WS26 


3 


One_8 


1 


17 


WS03 


3 


One_4 


1 


18 


WS01 


4 


One_13 


2 


19 


WS11 


2 


One_19 


2 


20 






One_10 


1 


21 


WS09 


3 


One_20 


1 


22 


WS02 


1 


One_1 


1 


23 


WS04, WS16 


2 


One_22 


1 


24 


WS1B 


2 


One_9, One_16 


2 


25 


WS10 


1 


One_7 


1 


26 


WS08 


3 


One 20 


I 


27 


WS08 


2 


One_23 




28 






One_6 


1 


29 


WS10 


1 


One_6 


1 


30 


WS29 


1 


One_21 


1 


31 


WS06 


1 


One_10, 
One_13, 
One_14 


3 


32 


WS12, WS1B 


3 






33 










34 






One_21 


1 


Total 




81 




41 



RAD, Restriction-site-associated DNA. 



tional two suggestive QTL on linkage groups four and 
seven, located at 73 and 63 cM, respectively. These QTL 
accounted for between 34 and 64% of the PVE (Table 4). 
In female F13, no thermo tolerance QTL were discovered at 
the genome-wide level. Additional data on each QTL for 
thermotolerance and body size (see below), including QTL 
effect, can be found in Table S5. 

Body size QTL 

A significant difference in both body weight and length was 
detected between the offspring from the two half-sib fami- 



lies (P < 5.94*10"*'* for body weight, P < 2.83*10"^ for 
length). However, as these families were reared in separate 
tanks, it is impossible to determine whether this is a genetic 
effect or simply a confounding environmental effect. To 
account for this, the residuals from the anova analysis were 
used for male-effect QTL detection. 

Neither male- nor female-linked alleles showed any sig- 
nificant QTL for length at the experiment-wide level. At 
the chromosome-wide level, one QTL was found to be sig- 
nificant for length in the male-linked alleles on linkage 
group 6 at 64 cM. An additional four suggestive QTL were 
found on linkage groups 5, 7, 9, and 14, at positions 69, 95, 
86, and 20 cM (Table 4). The four female-linked QTL for 
length were only suggestive and located on linkage groups 
9, 14, 23, and 34 at 5, 44, 60, and 2 cM. These four QTL 
were all found in F14 (Table 4). These QTL accounted for 
between 14 and 20% of PVE in the male, and 27 and 34 
PVE for female F14. 

A total of eight significant and suggestive QTL were 
found for body weight. Two male-linked QTL were signifi- 
cant at the experiment-wide level (Table 4, Fig. 2). These 
QTL were located on linkage group 6 at 29 cM and linkage 
group 9 at 5 cM. The male had one additional QTL signifi- 
cant at the chromosome-wide level, located on linkage 
group 14 at 20 cM. These QTL accounted for between 19 
and 29 PVE (Table 4). Alleles from both females contained 
QTL for body weight. In F14, there were three suggestive 
QTL on linkage groups 6, 9, and 34 at 14, 5, and 2 cM. In 
F13, there were also two suggestive QTL on linkage groups 
14 and 18 at positions 74 and 7 cM. These accounted for 
between 26 and 33 PVE in F14, and 31 and 32 PVE in F13 
(Table 4). 

Discussion 

Our objective was to use haploids and GBS to rapidly cre- 
ate a dense meiotic map for Chinook salmon to enable 
QTL analyses and anchor later genomic analyses. We suc- 
cessfully mapped over 3500 RAD- and EST-based SNP loci 
in a single haploid family, and we then detected thermal 
tolerance QTL that were isolated to LG 11, LG 16, and LG 
34 in corresponding diploid families. Additionally, as size 
data were readily available, we were able to isolate growth- 
associated QTL to other linkage groups. Clearly the now 
mature GBS platforms, in this case coupled with the use of 
haploids, enable rapid development of genomic resources 
in organisms for which few exist, providing excellent 
opportunities to gain new insights into adaptation and 
genomic architecture (Miller et al. 2012). An improved 
understanding of the genetic basis of temperature tolerance 
is especially important for the management and conserva- 
tion of aquatic species given the expectations of ecosystem 
warming. 
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Map position (cm) 

Figure 2 The distribution of F-values containing QTL significant at tine 
experiment-wide level in the male (M7) and female (F14) parents. (A) F- 
values for thermotolerance in the male. (B) F-values for thermotolerance 
in the female F14. (C) f-values for body weight in the male. The dark 
horizontal line in all three figures is the experiment-wide (P < 0.05) sig- 
nificance threshold determined by a 10 000 permutation test (Churchill 
and Doerge 1994). Vertical lines designate individual linkage groups. 
Linkage group 1 1 on plot A contains two peaks, our model specified 
one QTL per linkage group, so only the higher of the two peaks was 
determined to be a QTL. QTL, Quantitative trait loci. 

We used paired-end sequencing in multiple individuals 
to assemble longer contigs in order to annotate our map. 
Of 3472 RAD markers, 426 (-12%) were successfully 



annotated using the SwissProt database (Table S3). This 
rate of annotation is similar to that in sockeye salmon 
using similar techniques (-22%; Everett et al. 2012). Our 
annotations contained almost 50% transposable elements. 
This is consistent with other studies in salmonids; trans- 
posable elements may have an important role in adaptation 
(Danzmann et al. 2008). Five markers in the region of our 
significant QTL had apparently nonremarkable annota- 
tions: Three were transposons, one a nuclear encoded ribo- 
somal gene, and one was an EST-based SNP, Ots- 104063- 
132 [GenBank HR308694, NADH dehydrogenase 1 beta 
subcomplex subunit 2 (Clemento et al. 2011)]. 

The use of the restriction enzyme Sbfl, commonly used 
for RAD sequencing in salmonids, was also intended to 
increase the rate of annotation (Baird et al. 2008). RAD 
sequencing generates templates randomly spread through- 
out the genome, of which only -1% might be expected to 
lie in coding sequences. The enzyme Sbfl (CCTGCAGG) is 
75% GC and has been shown to cut more frequently in 
coding sequences (Baird et al. 2008; Amores et al. 2011). 
In zebrafish, 11% of Sbfl recognition sites are in protein- 
coding genes, and in stickleback, the figure is 16% (Amores 
et al. 2011). We do not understand why our annotation 
rate for protein-coding genes was somewhat lower, -6% 
excluding transposons, although this rate is slightly higher 
than a similar study by Sanchez et al. (2009) who observed 
only -1.5% in rainbow trout using reduced representation 
libraries constructed with the enzyme Haelll. 

Typically QTL studies of this type require multiple gen- 
erations with the use of inbred or hybrid lines. Those 
experimental designs are incompatible with nonmodel spe- 
cies such as Chinook salmon whose anadromous life his- 
tory makes multigenerational studies extremely difficult. In 
this study, our use of paired haploid and diploid families to 
identify and map thousands of markers allowed us to iden- 
tify QTL within one generation. 

Thermal tolerance QTL have been identified in multi- 
generational study of domesticated salmonids using back- 
cross families derived from captive brood stocks. For 
example, Jackson et al. (1998) identified two QTL associ- 
ated with upper temperature tolerance in backcross families 
of rainbow trout that accounted for between nine and 13% 
of the phenotypic variance. Two significant QTL for upper 
temperature tolerance in Arctic charr (Salvelinus alpinus) 
were found to be on homeologs and in a location similar to 
the QTL markers in the corresponding rainbow trout link- 
age groups (Somorjai et al. 2003). Comparative studies 
such as (Somorjai et al. 2003) provide insight into the fate 
of QTL after the salmonid gene duplication. 

The QTL located on our LG 11, LG 16, and LG 34, which 
correspond to rainbow trout chromosomes 19, 20, and 21 
described in Miller et al (2012) (Tables 3 and 4: Fig. 2), 
were responsible for up to 64% the observed variance in 
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Table 4. Significant QTL. F-values and tinreslnolds for significant QTL at botli tlie clironnosome- and genome-wide levels. Trait 1 is thermotolerance, 
two is length, and three is weight. 

Chromosome- 



Linkage 
Group 


Trait 


Parent 


Position 
(cM) 


F 


wide f-value 
threshold 
(P< 0.05) 


Chromosome-wide 
f-value threshold 
(P< 0.01) 


Experiment-wide 
f-value threshold 
{P< 0.05) 


Significance 


% PVE 


95% C.I. (cM) 


4 


1 


F14 


73 


10.44 


9.71 


13.89 


18.05 




34.82 


2.0-93.0 


5 


2 


M7 


69 


10.34 


8.51 


11.48 


13.96 




17.94 


22.5-95.0 


6 


2 


M7 


64 


11.53 


7.83 


10.78 


13.96 


A 


20.11 


23.0-67.0 


6 


3 


M7 


29 


16.73 


8.37 


11.96 


14.85 




29.31 


4.0-64.0 


6 


3 


F14 


14 


9.11 


8.93 


12.47 


16.29 




30.29 


1.0-78.0 


7 


2 


M7 


95 


10.79 


8.34 


12.33 


13.96 




18.75 


8.0-103.0 


7 


1 


F14 


63 


12.84 


10.13 


13.99 


18.05 




42.73 


17.0-89.0 


9 


2 


M7 


86 


10.19 


8.51 


11.56 


13.96 




17.66 


1 .0-88.0 


9 


2 


F14 


5 


8.43 


8.24 


11.06 


14.04 




27.94 


4.0-92.0 


9 


3 


M7 


5 


15.56 


8.86 


11.85 


14.85 


** 


27.28 


2.0-88.0 


9 


3 


F14 


5 


10.04 


9.09 


12.49 


16.29 




33.46 


4.0-92.0 


11 


1 


M7 


74 


16.98 


6.97 


10.59 


15.37 


** 


29.73 


2.0-74.0 


14 


2 


M7 


20 


8.32 


7.62 


10.48 


13.96 




14.20 


2.0-83.0 


14 


2 


F14 


44 


8.94 


8.57 


11.54 


14.04 




29.69 


1.0-112.0 


14 


3 


M7 


20 


1 1 .36 


7.89 


1 1 .25 


14.85 


A 


19.79 


8.0-53.0 


14 


3 


F13 


74 


9.86 


9.35 


12.76 


15.82 




32.87 


8.0-79.0 


16 


1 


M7 


6 


17.05 


8.32 


11.89 


15.37 


** 


29.86 


1.0-67.0 


18 


3 


F13 


7 


9.53 


8.39 


12.00 


15.82 




31.72 


0.0-35.0 


23 


2 


F14 


60 


9.08 


8.18 


11.18 


14.04 




30.19 


0.0-62.0 


26 


1 


M7 


30 


7.48 


7.30 


10.60 


15.37 




12.61 


0.0^4.0 


30 


1 


M7 


13 


7.65 


6.18 


9.83 


15.37 




12.93 


0.0-53.0 


34 


1 


M7 


13 


11.95 


6.80 


10.42 


15.37 


A 


20.87 


9.0-25.0 


34 


1 


F14 


5 


19.92 


7.79 


11.50 


18.05 


A* 


64.18 


4.0-27.0 


34 


2 


F14 


2 


9.50 


6.99 


9.82 


14.04 




31.64 


1.0-14.0 


34 


3 


F14 


2 


8.08 


7.41 


11.10 


16.29 




26.69 


1.0-21.0 



QTL, quantitative trait loci. 

In this study, loci with a P-value < 0.05 at the experiment-wide level (**) are significant at the experiment-wide level. At the chromosome-wide level, 
loci with a P-value < 0.01 (*) are defined as significant QTL at the chromosome-wide level, while loci with a P-value < 0.05 are suggestive of a QTL. 
Threshold values were determined via a 10 000 permutation test (Churchill and Doerge 1994). Percent PVE is percentage of phenotypic variance 
explained, calculated as PVE = 4[1-(MSEfui/MSEreduced)] where MSEfun and MSErejuced are the mean squared error of the full model and mean 
squared error of the reduced model (parameters fixed), respectively (Knott et al. 1996). The 95% confidence intervals were determined with 10 000 
bootstraps with replacement. 



thermotolerance. The markers associated with the upper 
temperature tolerance in rainbow trout were found on 
rainbow trout chromosomes 9, 19, and 25 (Jackson et al. 
1998; Palti et al. 2012). While two of our QTL do not 
appear to overlap those of rainbow trout from the Jackson 
et al. (1998) study, we did find one corresponding relation- 
ship on chromosome 19. A possible explanation for the 
lack of additional matches could be the differences between 
our maps and studies. Jackson et al. (1998) used an early 
map for rainbow trout, composed of fewer markers, so our 
power to detect QTL will be different. Our experimental 
approaches for determining temperature tolerance were 
also different, and the individuals included in our study are 
not fixed for the traits in question. As a result, QTL in indi- 
vidual families may not be segregating and would not be 
detected. Finally, different genes may be associated with 
temperature tolerance in the two species, or the same genes 



may be on different chromosomes. Recent research in 
Atlantic salmon suggests that there may be large genetic 
variation in thermal tolerance even within a species (Anttila 
et al. 2013). 

We observed large variability in the PVE for each QTL 
among the male and female parents (PVE ranging from 
12-64%). While other studies have a fair amount of vari- 
ability [13-36% (Gutierrez et al. 2012)], none were quite 
this high. This may be an artifact of our method. Qur sam- 
ple size was relatively low, 100 individuals per half-sib fam- 
ily, and used outbred individuals. Gutierrez et al. (2012) 
used 120 individuals per family and included five families. 
Jackson et al. (1998) used between 104 and 144 individuals 
per family, and the families were based on a backcross 
breeding design. In our study, the highest PVE values were 
all in female El 4, whose haploid offspring were used to 
construct the linkage map. The offspring of E14 had the 
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highest percentage of segregating markers as a resuk. This 
may also explain the lack of QTL discovered in F13; more 
markers were segregating for F14, giving more opportuni- 
ties to discover QTL. Future studies should incorporate 
multiple haploid families for mapping to diversify the 
number of segregating markers among families. 

Broader implications 

Why is understanding of the genetic basis of adaptation 
increasingly important? Some species and populations may 
be able to adjust to climate warming if the change is slow, 
accumulating over centuries or possibly even several dec- 
ades. However, many wild populations will less likely adapt 
in the face of rapid change. Climate records show that the 
Pacific Northwest corridor of North America is warming, 
substantially faster than the global average; warming rates 
of 0.1-0.6°C per decade are anticipated during the coming 
century (ISAB 2007). These changes will alter snow pack, 
seasonal stream flows and temperatures, and water quality: 
all factors that affect locally adapted species including popu- 
lations of Pacific salmon. Projections suggest that warming 
wiU cause major segments of territory for Pacific salmon in 
North America to become fragmented or inhabitable within 
just a few decades (Battin et al. 2007; IPCC 2008). 

Genetic differences between populations mediate diverse 
responses to climate change (Peterman et al. 1998; Crozier 
and Zabel 2006; Rogers and Schindler 2011). Matala et al. 
(2011) demonstrated that a number of SNPs in Chinook 
salmon were significantly associated with temperature and 
that the temperature-associated loci performed better at 
assigning populations to region and climate than did neu- 
tral markers. In Washington State, USA, a population of 
native Chinook salmon was discovered in Lower Crab 
Creek, a creek thought unsuitable for their survival. Tem- 
peratures in the creek regularly exceed those predicted to 
be lethal for Chinook salmon, but the habitat was found to 
support a native, breeding population that is uniquely 
adapted to survival under the harsh conditions (Small et al. 
2011). However, the genetic mechanisms underlying this 
potential for adaptation remain poorly understood. 

We believe that genomic data, especially data that iden- 
tify adaptively important loci, will provide key information 
to policy makers who wiU establish ranking criteria for pri- 
oritizing populations for conservation (Clarkson et al. 
2012; Hutchings et al. 2012). GBS approaches wiU provide 
an efficient mechanism for enabling such data (reviewed in 
Allendorf et al. 2010). Contemplating such data sets in the 
context of high-density maps is already providing impor- 
tant insights into the genetics and genomic architecture of 
adaptively important traits (Hecht et al. 2012; Gagnaire 
et al. 2013a). However, classical mapping exercises may 
take two or more generations; the doubled haploid lines 
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used in Miller et al. (2012) were developed over two dec- 
ades (Parsons and Thorgaard 1984; Lucas et al. 2004). The 
mapping of haploids as done here provides a comparatively 
rapid approach to acquire robust genomics data. 

Our map already provided a basis to locate genomic 
regions that are candidates for selection in five population 
of Chinook salmon from Alaska, although QTL reported 
here were not implicated (Larson et al. 2013). That study 
also utilized the linkage relationships of the RAD tags to 
calculate effective population size, a novel method for non- 
model organisms. It is also important to note that employ- 
ing haploids for mapping enables the immediate 
categorization of paralogous sequence variants in the dupli- 
cated salmonid genome (SprueU et al. 1999), a factor that 
greatly complicates interpretation of next-generation geno- 
typing data (Seeb et al. 2011). 

We believe that the use of GBS coupled with haploid 
mapping will expedite applications of genomic information 
by conservation and management agencies. Many interested 
in salmonids are already aggressively seeking genomics data 
to improve understanding of adaptively important loci 
(Hale et al. 2013; Hecht et al. 2013; Larson et al. 2013). 
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